This article explains that how to extract HTML formatted text from document page in Java....Watermark Product Solution GroupDocs...welcome to extract data from PDF, DOC, DOCX, PPT, PPTX, XLS, XLSX...
This code snippet demonstrates how to extract information about known properties that can be encountered in a particular package.
Load a file to examine Get a collection of PropertyDescriptor instances for any desired metadata package Iterate through the extracted descriptors advanced_usage.GettingKnownPropertyDescriptors
try (Metadata metadata = new Metadata(Constants.InputDoc)) { WordProcessingRootPackage root = metadata.getRootPackageGeneric(); for (PropertyDescriptor descriptor : root.getDocumentProperties().getKnowPropertyDescriptors()) { System.out.println(descriptor.getName()); System.out.println(descriptor.getType()); System.out.println(descriptor.getAccessLevel()); for (PropertyTag tag : descriptor.getTags()) { System.out.println(tag); } System.out.println(); } } Note Not all possible properties are presented in the getKnowPropertyDescriptors collection....Watermark Product Solution GroupDocs...to view and edit metadata of PDF, DOC, DOCX, PPT, PPTX, XLS, XLSX...
Detecting the GIF version The following sample of code will help you to detect the version of a loaded GIF image and extract some additional file format information.
Load a GIF image Extract the root metadata package Use the FileType property to obtain file format information AdvancedUsage.ManagingMetadataForSpecificFormats.Image.Gif.GifReadFileFormatProperties
using (Metadata metadata = new Metadata(Constants.InputGif)) { var root = metadata.GetRootPackage(); Console.WriteLine(root.FileType.FileFormat); Console.WriteLine(root.FileType.Version); Console.WriteLine(root.FileType.ByteOrder); Console.WriteLine(root.FileType.MimeType); Console.WriteLine(root.FileType.Extension); Console.WriteLine(root.FileType.Width); Console.WriteLine(root.FileType.Height); } Working with XMP Metadata GroupDocs....Watermark Product Solution GroupDocs...to view and edit metadata of PDF, DOC, DOCX, PPT, PPTX, XLS, XLSX...
This article explains how to load document of specific format....Watermark Product Solution GroupDocs...GroupDocs.Watermark Product Family / GroupDocs.Watermark for .NET...
This code snippet demonstrates how to extract information about known properties that can be encountered in a particular package.
Load a file to examine Get a collection of PropertyDescriptor instances for any desired metadata package Iterate through the extracted descriptors advanced_usage.GettingKnownPropertyDescriptors
JavaScript const metadata = new groupdocs.metadata.Metadata("input.doc"); var root = metadata.getRootPackageGeneric(); var descriptors = root.getDocumentProperties().getKnowPropertyDescriptors(); for(var i=0;iWatermark Product Solution GroupDocs...to view and edit metadata of PDF, DOC, DOCX, PPT, PPTX, XLS, XLSX...
In some cases it’s required to specify the document format manually to guarantee correct output produced by GroupDocs.Parser. The following are the cases when the document format must be specified manually:
Markdown documents MHTML documents OTP documents (OpenDocument Presentation Template) Databases Emails from remote servers Here are the steps to specify the document format for Markup document.
Instantiate the LoadOptions object and pass the document format in LoadOptions(FileFormat) constructor; Create Parser object and call any method....Watermark Product Solution GroupDocs...welcome to extract data from PDF, DOC, DOCX, PPT, PPTX, XLS, XLSX...
This article shows how to get the basic document info....Watermark Product Solution GroupDocs...welcome to extract data from PDF, DOC, DOCX, PPT, PPTX, XLS, XLSX...
It supports DOCX, DOCM, DOC, DOT, DOTM, XLS, XLSX, Pdf, PPT, JPG, PNG, HTML, EML and many more....Watermark Product Solution GroupDocs...Filters Saves the document as PDF (Adobe Portable Document) format...
GroupDocs.Metadata allows users to get meta information of a document....Watermark Product Solution GroupDocs...to view and edit metadata of PDF, DOC, DOCX, PPT, PPTX, XLS, XLSX...
This article explains how to provide advanced options when updating Text electronic signatures with GroupDocs.Signature API....Watermark Product Solution GroupDocs...IsSignature Image Spreadsheet Pdf Presentation Word Processing...